TNL: NUMERICAL LIBRARY FOR MODERN PARALLEL ARCHITECTURES

نویسندگان

چکیده

We present Template Numerical Library (TNL, www.tnl-project.org) with native support of modern parallel architectures like multi–core CPUs and GPUs. The library offers an abstract layer for accessing these via unified interface tailored easy fast development high-performance algorithms numerical solvers. is written in C++ it benefits from template meta–programming techniques. In this paper, we the most important data structures TNL together scalability on speed–up GPUs supporting CUDA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A numerical differentiation library exploiting parallel architectures

Article history: Received 7 July 2008 Received in revised form 15 January 2009 Accepted 10 February 2009 Available online xxxx PACS: 02.60.Jh 02.60.Pn 02.70.Bf

متن کامل

A Parallel Numerical Library for UPC

Unified Parallel C (UPC) is a Partitioned Global Address Space (PGAS) language that exhibits high performance and portability on a broad class of shared and distributed memory parallel architectures. This paper describes the design and implementation of a parallel numerical library for UPC built on top of the sequential BLAS routines. The developed library exploits the particularities of the PG...

متن کامل

NDL-v2.0: A new version of the numerical differentiation library for parallel architectures

Wepresent a new version of the numerical differentiation library (NDL) used for the numerical estimation of first and second order partial derivatives of a function by finite differencing. In this version we have restructured the serial implementation of the code so as to achieve optimal task-based parallelization. The pure shared-memory parallelization of the library has been based on the ligh...

متن کامل

Parleda: a Library for Parallel Processing in Computational Geometry Applications

ParLeda is a software library that provides the basic primitives needed for parallel implementation of computational geometry applications. It can also be used in implementing a parallel application that uses geometric data structures. The parallel model that we use is based on a new heterogeneous parallel model named HBSP, which is based on BSP and is introduced here. ParLeda uses two main lib...

متن کامل

Efficiency improvements of iterative numerical algorithms on modern architectures

For many numerical codes the transport of data from main memory to the registers is commonly considered to be the main limiting factor to achieve high performance on present micro architectures. This fact is referred to as the memory wall. A lot of research is targeting this point on different levels. This covers for example code transformations and architecture aware data structures to achieve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Acta Polytechnica

سال: 2021

ISSN: ['1210-2709', '1805-2363']

DOI: https://doi.org/10.14311/ap.2021.61.0122